Dilated-Scale-Aware Category-Attention ConvNet for Multi-Class Object Counting
نویسندگان
چکیده
Object counting aims to estimate the number of objects in images. The leading approaches focus on single category task and achieve impressive performance. Note that there are multiple categories real scenes. Multi-class object expands scope application task. multi-target detection can multi-class some scenarios. However, it requires dataset annotated with bounding boxes. Compared point annotations mainstream issues, coordinate box-level more difficult obtain. In this paper, we propose a simple yet efficient network based point-level annotations. Specifically, first change traditional output channel from one multiclass counting. Since all use same feature extractor our proposed framework, their features will interfere mutually shared space. We further design multi-mask structure suppress harmful interaction among objects. Extensive experiments challenging benchmarks illustrate method achieves state-of-the-art
منابع مشابه
Multi-scale Location-aware Kernel Representation for Object Detection
Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods demonstrate that the integration of highorder statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model w...
متن کاملMulti-scale Cortical Keypoint Representation for Attention and Object Detection
Keypoints (junctions) provide important information for focus-of-attention (FoA) and object categorization/recognition. In this paper we analyze the multi-scale keypoint representation, obtained by applying a linear and quasi-continuous scaling to an optimized model of cortical end-stopped cells, in order to study its importance and possibilities for developing a visual, cortical architecture. ...
متن کاملVisual Attention Models of Object Counting
We develop a sequential learning model using a recurrent neural network architecture and reinforcement learning to recognize and count objects in images. Simple feedforward neural networks perform well on this task when trained using backpropagation; however, convolutional neural networks are computationally expensive and results are less certain when the image input has imperfect resolution ou...
متن کاملLarge-Scale Category Structure Aware Image Categorization
Most previous research on image categorization has focused on medium-scale data sets, while large-scale image categorization with millions of images from thousands of categories remains a challenge. With the emergence of structured large-scale dataset such as the ImageNet, rich information about the conceptual relationships between images, such as a tree hierarchy among various image categories...
متن کاملA New Class of Multi-Category Learning for Multiple Video Object Extraction
Video object (VO) extraction is of great importance in multimedia processing. In recent years approaches have been proposed to deal with VO extraction as a classification problem. This type of methods calls for state-of-the-art classifiers because the performance is directly related to the accuracy of classification. Promising results have been reported for single object extraction using Suppor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Signal Processing Letters
سال: 2021
ISSN: ['1558-2361', '1070-9908']
DOI: https://doi.org/10.1109/lsp.2021.3096119